Some Considerations on Universality 



Manfred Kudlek 

Dept. Informatik, MIN Fakultat, Univ. Hamburg 
kudlekOinf ormat ik . uni-hamburg . de 

The paper puts into discussion the concept of universality, in particular for structures not of the 
power of Turing computability. The question arises if for such structures a universal structure of the 
same kind exists or not. For that the construction of universal Turing machines and those with some 
constraints are presented in some detail. 

1 Introduction 

Without the investigation of the concept of universality it is quite possible that our modern computers 
wouldn't exist. Although they are finite automata, at least theoretically, practically they can be treated as 
Turing machines processing any special program which can be seen as a special Turing machine. This 
paper presents some general considerations on that concept, presenting in some more detail universality 
for general Turing machines, deterministic as well as non-deterministic ones. Universal devices not only 
have been considered for Turing machines (TM's), but also for cyclic Post machines (CPM's), cellular 
automata (CA's), random access machines (RAM's), and others. 

Very small universal devices mainly have been constructed for DTM's, CPM's, and RAM's. Until 
recently, the encoding of special deterministic Turing machines (DTM's) for small universal (determin- 
istic) Turing machines (U(D)TM's) was achieved via universal tag systems, needing exponential space 
and time for the simulation. A polynomial amount of space and time was shown in (TJIH. 

Universality has mainly been investigated intensively for the class of Turing computability whereas 
there exist only few publications on universality for restricted systems, having the same restriction. Such 
a restriction is e.g. space or time complexity, or working conditions. Furthermore, the concept of uni- 
versality also should be investigated for other underlying structures than free monoids, e.g. commutative 
monoids or more general structures. Actually the ideas presented here arose from the question if there 
exists a universal (higher order) Petri nets capable of simulating all special Petri nets but not having 
power of Turing computability. 

The concept of universality discussed here is somehow restricted, since there should not just be a 
black box with input special machine (e.g. a DTM) together with its input, and output of the somehow 
simulated result, without considering also e.g. space and time amount of the simulation HID- It should 
also somehow simulate the (local) behaviour (e.g. in space and time). 

To clarify the problem universality for Turing machines will be considered in more detail. 

2 General luring Machines 

For general Turing machines (TM's), in particular for the deterministic version (DTM), there exists the 
concept of universality. This means that a single DTM can be constructed which can simulate any 
special DTM in a certain way. For that purpose it is necessary to encode a DTM as well as its input. 
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Since the encodings have to be invertible, the corresponding encoding functions have to be injective, and 
the encoding as well as decoding functions have to be computable by DTM's. 

Formally, assume an enumerable universal alphabet E, such that that any special DTM M has a 
finite input and work alphabet L M C E. Similarly, assume an enumerable universal set of states Q with 
Q n E = 0, such that any special DTM M has a finite set of states Qm C Q. In both cases, E and Q are 
equivalent to N. It can be also assumed that Q = {qi,- • • ,1\q m \}, and that the initial state of M is q\ and 
the final state q\Q M \. Furthermore, that by renaming the symbols of M are Em = {s\, ■ ■ ■ 

Any DTM M defines a local function /m given by the program of M, affecting the configurations of 
M. Those can be written as wiqxwR where wl,wr G E m , x G Em, and q G Qm- An initial configuration 
has the form c\ = wiqiWR, and a final configuration the form WLq\Q u \WR, where wl G E m and wr G E m . 
Quite often also a normal form c\ = q\w, and cp = q\Q M \w, respectively, is used where w G E M . A step 
of M can then be expressed by /m(wl^wr) = w' L q'x'w' R , with the usual interpretation of new symbol, 
new state and movement. Thus /m '■ L m Qm^ m — ► E m <2mE m - Im can be extended to a, in general partial, 
function f M : E M £ M E M -> E M 2 M E M by the definition 

fmiwiqiWR) = w' L q\Q M \w' R <^> 3?i G N : /mO^i^s) = w L^|eM| w K- This also defines a function 
/m : E M -> E M by (w = w L w R and w' = w' L w' R ) f M {w) = w' 44> /m( w £<7i w #) = w L^|e M | w R- For the above 
mentioned normal form this gives fli{q\w) = q\Q M \w' 44> 3nSN: fjwiqiw) = q\Q M \w' and 

f M ( w ) = w >o fZf{q\w) = q\ Qu \w'. 

For a universal DTM U the program as well as the configurations of special DTM M have to be 
encoded. Simple encoding and decoding functions should be used for that purpose. They have to be 
injective and computable by DTM's *Fm and <I>m, as well as the inverses by some DTM's *¥' M and 
<t>' M . Actually, these DTM are related to M since some information on Qm and Em is necessary. Let 
E[/ C E be the alphabet, and Qu C Q the set of states of U. One possibility of encoding are functions 
fy> M : (E M U Q m ) -> (L v \ {#})+ and : (E M U Q M ) -> (Ej/ \ {#})+ where # G L v is a special separator. 

The initial and final configurations of U for simulating M can be chosen in a normal form as 
q\xfy> M {P M )#yU M {ci) and q\Q u \xh M {PM)#yU M (cF) where x,? G L v are special symbols indicating the 
current position in the program Pm and configuration cm, Pm is the program of M, and # G E is a sepa- 
rator. Both functions, fe M and f& M are injective. The configurations of U simulating M have one of the 
forms 

uiquRXu' R #viyvR, uixu R qu' R #vLyvR, uixuR#viqv' L yvR, or 
ULXUR#VLxV L qvR, respectively. 

For the simulation the following condition for one step cm — > c' M of M, or reaching a final configura- 
tion from the the initial configuration of M holds (here for normal forms): 
VMVc m 3tiGN : f M {qxfc M {P M )#yfa M {cM)) = q'xfo M (PM)#yf<t> M (c' M ) 

or 

VMBrcGN : fu(qixh> M (P M )#yf<s> M (c\)) = q\ Qu \xfy M (PM)#yfy M (cF). 

Often {0, 1} C Ej/ is chosen for the encoding alphabet. Em, Qm, Pm, and configurations cm are en- 
coded either in unary or binary. Then Sj can be encoded by V in unary or by bin(i) in binary. Analogously 
for qj. A configuration ■ ■■Si e qjSi (+l ■ --Si r can be encoded e.g. by 

T'O- • • r' f 00y'0r' f+1 0- • • r''0 in unary or by 

bin{i\ )a--- bin{i?)cbin{j)cbin{it + \ )a--- bin{i r )a in binary where a, b are separating symbols. Encod- 
ing 0, l,a,c e.g. by 00,01, 10, 1 1 then gives a binary encoding using only 0, 1. 

The unary encoding needs |w|(|Em| + 1) + \Qm\ +4 space, whereas the binary encoding needs 
2{\w\{\log 2 ^M\ + 1) + \logiQM~] +2) space. 



120 



Some Considerations on Universality 



Encoding and also decoding can be achieved by very simple DTM's, actually by deterministic fi- 
nite state transducers (DFST's) consuming linear time. For binary encoding the transducer also needs 
information on the sizes \Lm\ and \Qm\- 

A special class of TM is the class of TM with alphabet {0, 1}. In that case only states have to be 
encoded. Thus the bounds are 3|w| + \Qm\ +4 or 4|w| +2[/o§2|2m|1 + 4- 

The considerations so far also hold for non-deterministic TM's (NTM's). Encodings again can be 
achieved by DTM's. Instead of functions one has to consider binary relations between configurations. 
The universal NTM works like the deterministic counterpart, with the difference that because of the 
encoding of the non-deterministic program Pm the universal NTM has choices for its steps. 

It is also an interesting aspect to look for small universal NTM. 

3 Turing Machines with Constraint 

In the case of TM's with some constraint the DTM for encoding and decoding should be restricted to 
the same constraint. Such a constraint is e.g. space or time complexity. But it could be also reversibility 
or some property of the function or relation defined by the TM. In any case, besides alphabet and set of 
states of the special TM also the complexity function has to be encoded. 

In O it has been shown that for the class of DTM's with a binary alphabet {0, 1} and a space 
complexity function belonging to a subclass of primitive recursive functions over 1 variable, there exists 
a universal DTM with the same complexity constraint. Note that only one variable is needed for the 
computation of the complexity function. Examples of such primitive recursive complexity functions are 
g(x) = c-x, g(x) =c-x k , g(x) = c ■ 2 X , g{x) = c ■ 2 y , polynomials with non-negative coefficients 

k 

P( x ) = Yi c *^~ ixi G N ' 

i=i 

g(x) = 2 p W where p(x) is a polynomial as just defined. 

This subclass of primitive recursive functions g does not use projections and has the property: 

k 

3c g eNVieN* : c g + g{x) > £xj , 

i=i 

which in the case of 1 variable results in c g + g(x) > x. 
Thus g(x) = \log2(x)~\ does not belong to this subclass. 

If r g is the length of the representation of the space complexity function g then the universal DTM 
needs only s g (x) = g(x) + r g space. 

For the analogous time complexity constraint there is a price of polynomial cost. In this case there is 
a polynomial p g with non-negative coefficients for every primitive recursive function from the subclass 
defined above such that the time complexity of the universal DTM fulfills t g (x) < p g {g{x)) for all x € N. 

Since the space and time complexity functions are computed by DTM's it follows by straight forward 
arguments that the results from above also hold for non-deterministic TM's (NTM's), i.e. there exists a 
universal NTM for the subclass of primitive recursive functions as complexity constraints. 

Thus there exist universal DTM's and NTM's with complexity constraints for each complexity func- 
tion from the special class of primitive recursive functions (for time complexity only with additional 
polynomial). In particular, there exist universal (N)LBA's and DLBA's. 
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4 Finite Automata and Finite State Transducers 

In the case of deterministic and non-deterministic finite automata (DFA's, NFA's), as well as deterministic 
and non-deterministic finite state transducers (DFST's, NFST's) the encoding and decoding of special FA 
and FST should be achieved by DFST's. Otherwise, the encoding might contain too much computation 
power, going even beyond the power of FAs or FST's. 

In it was shown that under the above condition that encoding of special FAs and FST's has 
to be achieved by a DSFT, there exists no universal DFA, NFA, DFST and NFST The proof uses a 
contradiction on the number of states of such a universal automaton. 

Thus there are no universal FAs in this strong sense. 

In the same paper (10) it has also been shown that there exists a universal FA for the class of all 
FAs with a bounded number k of states. However, that universal FA does not belong to the same class, 
needing strictly more states. 

5 Pushdown Automata and Pushdown Transducers 

For universal DPDA's and NPDA's, as well as for corresponding transducers (DPDT's and NPDT's), the 
encoding of special DPDA and NPDA should be done by a DPDT. From the considerations above for 
Turing machines it follows that deterministic finite state transducers suffice for encoding and decoding. It 
is unknown that under such conditions there exist universal automata of that kind. But it is my conjecture 
that such do not exist, since there is a problem in accessing the program of a special PDA. Possibly this 
can be shown using Kolmogorov complexity arguments. 

6 Rewriting Systems 

In the case of word grammars or other word rewriting systems like L-systems, it is necessary to use a 
deterministic or monogeneous grammar or rewriting system of the same type for encoding the special 
ones. 

For general semi Thue systems (STS) or type-0 grammars this is straight forward since any such 
system can be simulated by a TM, and vice versa. In a similar way, type-1 or monotone frammars can 
be simulated by NLBA's and vice versa. 

For type-2 grammars, however, my conjecture is that there does not exist a universal grammar of 
that type. Here the encoding should be achieved by some kind of deterministic or monogeneous type-2 
grammar used as transducer. The same seems to hold for ETOL systems where encoding should be 
done by an EDTOL system used as transducer. An ETOL system O consists of a finite number of 
not necessarily disjoint finite sets of context-independent rules which in one derivation step have to be 
applied in parallel to all symbols in a word using only, not necessarily identical rules, of one set. In an 
EDTOL system there is only one rule for each symbol. 

7 Multiset Grammars 

Whereas for word grammars the encoding of special grammars can be done by a FST it is a problem 
for multiset grammars, at least for simple multisets as elements from N*. This arises when looking for 
a universal vector addition system (VAS) or a universal Petri net (PN). Most probably, such don't exist, 
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at least for simple multisets, i.e. elements from N k . The reason for that possibly is the commutativity of 
the basic underlying operation ©, whereas for word systems catenation is not commutative. 

Possibly, higher order multisets, used as tokens in higher order Petri nets, might be a solution for the 
construction of universal higher order Petri nets. However, they should not be of the power of Turing 
machines. 

8 General Universality 

A possible definition of general universality to cover e.g. all the cases considered so far, may be the 
following construction. 

Let be a universe of elements, equipped with a binary operation o and M a collection (class) of 
binary relations r C jft r x with the property o : x jjt r — > M r for r G M. 

To define a universal relation R for M encoding of elements x G and of r itself is necessary. This 
might be achieved by injective functions f r : — » C and g : M — > M R from the same class 
such that 

Vr G &Mx,y G ^# r 3?i G N : (g(r) of r (x),g(r) o f r (y)) G R" ^ (x,y) G r, where = ^ U -^R 
and n = 0, and R C ^ R x 

But there is a problem about the operation o since actually it is a ternary relation on 
More general, with a single pairing function (j> : ^ x <M —>■ .Jt R , the condition is 

Vr G ^x,y G ^ r 3n G N : (g(r),$(f r (x),f r (y)) G /?" ^ (x,y) G r. 

Another possibility may be to choose as basic universe a Scott domain as for the construction of a 
model for the A -calculus. 
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